Implementing a Task Specific Grammar for Recognition and Parsing using the CPK NLP Suite for Spoken Language Understanding
نویسنده
چکیده
This paper describes how a task specific grammar can be implemented using a dedicated “NLP” Augmented Phrase Structure (APS) grammar formalism. The APS is used for generation of appropriate semantic frames to be passed on to the dialogue manager of a spoken dialogue system. In a derived form, conforming to the HTK Standard Lattice format, the same APS may be used for constraining the approved speech recogniser grapHvite by Entropics. APS and HTK (standard lattice) are just two of several NLP and recognition grammar formats supported by the CPK NLP Suite for Spoken Language Understanding. The suite can be downloaded (in C++ source code) for research and other non-commercial use at the web address http://www.cpk.auc.dk/~tb/nlpsuite.
منابع مشابه
The CPK NLP suite for spoken language understanding
This paper describes a number of freely available tools for implementing and running spoken language understanding systems. Unlike other free tools (e.g. the CSLU toolkit), the main emphasis is on spoken language understanding (syntactic/semantic parsing, generation of language models for recognition etc.). The suite supports (reads and/or writes) a number of grammar formats defined for speech ...
متن کاملتصحیح خودکار خطا در درخت بانک نحوی با استفاده از یادگیری ماشینی انتقال محور
The Treebank is one of the most useful resources for supervised or semi-supervised learning in many NLP tasks such as speech recognition, spoken language systems, parsing and machine translation. Treebank can be developded in different ways that could be, generally, categorized in manually and statistical approaches. While the resulted Treebank in each of these methods has the annotation error,...
متن کاملCross-Domain and Cross-Language Porting of Shallow Parsing
English was the main focus of attention of the Natural Language Processing (NLP) community for years. As a result, there are significantly more annotated linguistic resources in English than in any other language. Consequently, data-driven tools for automatic text or speech processing are developed mainly for English. Developing similar corpora and tools for other languages is an important issu...
متن کاملIncremental Derivations in CCG
This paper presents a research note on the degree to which strictly incremental derivations (that is derivations which are fully connected at each point in time) are possible in Combinatory Categorial Grammar (CCG). There has been a recent surge of interest in incremental parsing both from the psycholinguistic community in a bid to build psycholinguistically plausible models of language compreh...
متن کاملFiltering Errors and Repairing Linguistic Anomalies for Spoken Dialogue Systems
Our work addresses the integration of speech recognition and language processing for whole spoken dialogue systems. To filter ill-recognized words, we design an on-line computing of word confidence scores based on the recognizer output hypothesis. To infer as much information as possible from the retained sequence of words, we propose a bottom-up syntacticosemantic robust parsing relying on a l...
متن کامل